Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 2988650 |
| Missing cells | 1876559 |
| Missing cells (%) | 4.8% |
| Duplicate rows | 37207 |
| Duplicate rows (%) | 1.2% |
| Total size in memory | 925.2 MiB |
| Average record size in memory | 324.6 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 4 |
| Text | 1 |
| Dataset has 37207 (1.2%) duplicate rows | Duplicates |
brand is highly overall correlated with cat1 and 1 other fields | High correlation |
cat1 is highly overall correlated with brand and 1 other fields | High correlation |
cat2 is highly overall correlated with brand and 1 other fields | High correlation |
cust_request_tn is highly overall correlated with customer_id and 2 other fields | High correlation |
customer_id is highly overall correlated with cust_request_tn and 1 other fields | High correlation |
product_id is highly overall correlated with cust_request_tn and 2 other fields | High correlation |
sku_size is highly overall correlated with product_id | High correlation |
tn is highly overall correlated with cust_request_tn and 2 other fields | High correlation |
plan_precios_cuidados is highly imbalanced (91.0%) | Imbalance |
stock_final has 1839319 (61.5%) missing values | Missing |
cust_request_tn is highly skewed (γ1 = 37.70988076) | Skewed |
tn is highly skewed (γ1 = 37.87580231) | Skewed |
stock_final has 34082 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-19 03:00:46.825437 |
|---|---|
| Analysis finished | 2025-05-19 03:03:15.286250 |
| Duration | 2 minutes and 28.46 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
periodo
Date
| Distinct | 36 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.8 MiB |
| Minimum | 2017-01-01 00:00:00 |
|---|---|
| Maximum | 2019-12-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
customer_id
Real number (ℝ)
High correlation 
| Distinct | 597 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10171.395 |
| Minimum | 10001 |
|---|---|
| Maximum | 10637 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 10001 |
|---|---|
| 5-th percentile | 10007 |
| Q1 | 10053 |
| median | 10133 |
| Q3 | 10267 |
| 95-th percentile | 10447 |
| Maximum | 10637 |
| Range | 636 |
| Interquartile range (IQR) | 214 |
Descriptive statistics
| Standard deviation | 142.03964 |
|---|---|
| Coefficient of variation (CV) | 0.013964617 |
| Kurtosis | -0.22512637 |
| Mean | 10171.395 |
| Median Absolute Deviation (MAD) | 98 |
| Skewness | 0.81392422 |
| Sum | 3.0398741 × 1010 |
| Variance | 20175.26 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10001 | 25122 | 0.8% |
| 10004 | 24487 | 0.8% |
| 10003 | 24100 | 0.8% |
| 10002 | 23553 | 0.8% |
| 10007 | 23469 | 0.8% |
| 10018 | 22645 | 0.8% |
| 10027 | 22634 | 0.8% |
| 10059 | 21983 | 0.7% |
| 10005 | 21518 | 0.7% |
| 10034 | 19705 | 0.7% |
| Other values (587) | 2759434 |
| Value | Count | Frequency (%) |
| 10001 | 25122 | |
| 10002 | 23553 | |
| 10003 | 24100 | |
| 10004 | 24487 | |
| 10005 | 21518 | |
| 10006 | 18419 | |
| 10007 | 23469 | |
| 10008 | 9192 | 0.3% |
| 10009 | 16749 | |
| 10010 | 11464 |
| Value | Count | Frequency (%) |
| 10637 | 2 | < 0.1% |
| 10636 | 5 | < 0.1% |
| 10635 | 51 | < 0.1% |
| 10634 | 16 | < 0.1% |
| 10633 | 2 | < 0.1% |
| 10632 | 2 | < 0.1% |
| 10631 | 21 | < 0.1% |
| 10630 | 65 | < 0.1% |
| 10629 | 8 | < 0.1% |
| 10626 | 187 |
product_id
Real number (ℝ)
High correlation 
| Distinct | 1233 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20423.189 |
| Minimum | 20001 |
|---|---|
| Maximum | 21299 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 20001 |
|---|---|
| 5-th percentile | 20023 |
| Q1 | 20155 |
| median | 20360 |
| Q3 | 20650 |
| 95-th percentile | 21014 |
| Maximum | 21299 |
| Range | 1298 |
| Interquartile range (IQR) | 495 |
Descriptive statistics
| Standard deviation | 312.94155 |
|---|---|
| Coefficient of variation (CV) | 0.015322854 |
| Kurtosis | -0.6238578 |
| Mean | 20423.189 |
| Median Absolute Deviation (MAD) | 237 |
| Skewness | 0.59229109 |
| Sum | 6.1037765 × 1010 |
| Variance | 97932.413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20037 | 9996 | 0.3% |
| 20100 | 9812 | 0.3% |
| 20020 | 9706 | 0.3% |
| 20230 | 9360 | 0.3% |
| 20010 | 9222 | 0.3% |
| 20021 | 8782 | 0.3% |
| 20105 | 8204 | 0.3% |
| 20022 | 8008 | 0.3% |
| 20111 | 7973 | 0.3% |
| 20122 | 7950 | 0.3% |
| Other values (1223) | 2899637 |
| Value | Count | Frequency (%) |
| 20001 | 6172 | |
| 20002 | 6000 | |
| 20003 | 6793 | |
| 20004 | 7139 | |
| 20005 | 5911 | |
| 20006 | 6497 | |
| 20007 | 6906 | |
| 20008 | 6453 | |
| 20009 | 5596 | |
| 20010 | 9222 |
| Value | Count | Frequency (%) |
| 21299 | 1 | |
| 21298 | 1 | |
| 21297 | 1 | |
| 21296 | 1 | |
| 21295 | 1 | |
| 21294 | 1 | |
| 21293 | 1 | |
| 21292 | 1 | |
| 21291 | 1 | |
| 21290 | 2 |
plan_precios_cuidados
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 165.3 MiB |
| 0 | |
|---|---|
| 1 | 34029 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2988650 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2988650 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2988650 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2954621 | |
| 1 | 34029 | 1.1% |
cust_request_qty
Real number (ℝ)
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1495019 |
| Minimum | 1 |
|---|---|
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 92 |
| Range | 91 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.5805763 |
|---|---|
| Coefficient of variation (CV) | 1.6657702 |
| Kurtosis | 54.039778 |
| Mean | 2.1495019 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3281174 |
| Sum | 6424109 |
| Variance | 12.820527 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2066593 | |
| 2 | 451619 | 15.1% |
| 3 | 152223 | 5.1% |
| 4 | 83170 | 2.8% |
| 5 | 47223 | 1.6% |
| 6 | 32179 | 1.1% |
| 7 | 23616 | 0.8% |
| 8 | 18792 | 0.6% |
| 9 | 14345 | 0.5% |
| 10 | 11975 | 0.4% |
| Other values (74) | 86915 | 2.9% |
| Value | Count | Frequency (%) |
| 1 | 2066593 | |
| 2 | 451619 | 15.1% |
| 3 | 152223 | 5.1% |
| 4 | 83170 | 2.8% |
| 5 | 47223 | 1.6% |
| 6 | 32179 | 1.1% |
| 7 | 23616 | 0.8% |
| 8 | 18792 | 0.6% |
| 9 | 14345 | 0.5% |
| 10 | 11975 | 0.4% |
| Value | Count | Frequency (%) |
| 92 | 1 | |
| 90 | 1 | |
| 88 | 1 | |
| 85 | 2 | |
| 84 | 1 | |
| 83 | 1 | |
| 79 | 1 | |
| 78 | 1 | |
| 77 | 1 | |
| 76 | 1 |
cust_request_tn
Real number (ℝ)
High correlation  Skewed 
| Distinct | 101954 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.47691053 |
| Minimum | 0.0001 |
|---|---|
| Maximum | 551.56137 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 0.0001 |
|---|---|
| 5-th percentile | 0.00209 |
| Q1 | 0.01057 |
| median | 0.04095 |
| Q3 | 0.1638 |
| 95-th percentile | 1.604051 |
| Maximum | 551.56137 |
| Range | 551.56127 |
| Interquartile range (IQR) | 0.15323 |
Descriptive statistics
| Standard deviation | 3.276818 |
|---|---|
| Coefficient of variation (CV) | 6.8709283 |
| Kurtosis | 2789.3242 |
| Mean | 0.47691053 |
| Median Absolute Deviation (MAD) | 0.03658 |
| Skewness | 37.709881 |
| Sum | 1425318.6 |
| Variance | 10.737536 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01638 | 19921 | 0.7% |
| 0.04095 | 16229 | 0.5% |
| 0.00218 | 15964 | 0.5% |
| 0.00819 | 15171 | 0.5% |
| 0.0819 | 14603 | 0.5% |
| 0.00983 | 14272 | 0.5% |
| 0.03276 | 14052 | 0.5% |
| 0.02457 | 13014 | 0.4% |
| 0.01092 | 12684 | 0.4% |
| 0.00491 | 12504 | 0.4% |
| Other values (101944) | 2840236 |
| Value | Count | Frequency (%) |
| 0.0001 | 170 | < 0.1% |
| 0.00013 | 79 | < 0.1% |
| 0.00018 | 159 | < 0.1% |
| 0.0002 | 238 | < 0.1% |
| 0.00021 | 628 | |
| 0.00022 | 104 | < 0.1% |
| 0.00023 | 744 | |
| 0.00025 | 299 | |
| 0.00026 | 262 | < 0.1% |
| 0.00029 | 137 | < 0.1% |
| Value | Count | Frequency (%) |
| 551.56137 | 1 | |
| 510.65893 | 1 | |
| 444.41192 | 1 | |
| 439.90647 | 1 | |
| 437.37767 | 1 | |
| 416.64823 | 1 | |
| 407.02225 | 1 | |
| 393.26092 | 1 | |
| 389.02653 | 1 | |
| 384.82574 | 1 |
tn
Real number (ℝ)
High correlation  Skewed 
| Distinct | 101922 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.46684062 |
| Minimum | 0.0001 |
|---|---|
| Maximum | 547.87849 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 0.0001 |
|---|---|
| 5-th percentile | 0.00209 |
| Q1 | 0.01052 |
| median | 0.0409 |
| Q3 | 0.1638 |
| 95-th percentile | 1.58995 |
| Maximum | 547.87849 |
| Range | 547.87839 |
| Interquartile range (IQR) | 0.15328 |
Descriptive statistics
| Standard deviation | 3.1598884 |
|---|---|
| Coefficient of variation (CV) | 6.7686664 |
| Kurtosis | 2850.397 |
| Mean | 0.46684062 |
| Median Absolute Deviation (MAD) | 0.03653 |
| Skewness | 37.875802 |
| Sum | 1395223.2 |
| Variance | 9.9848948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01638 | 19931 | 0.7% |
| 0.04095 | 16228 | 0.5% |
| 0.00218 | 15965 | 0.5% |
| 0.00819 | 15181 | 0.5% |
| 0.0819 | 14608 | 0.5% |
| 0.00983 | 14272 | 0.5% |
| 0.03276 | 14070 | 0.5% |
| 0.02457 | 13013 | 0.4% |
| 0.01092 | 12686 | 0.4% |
| 0.00491 | 12502 | 0.4% |
| Other values (101912) | 2840194 |
| Value | Count | Frequency (%) |
| 0.0001 | 170 | < 0.1% |
| 0.00013 | 79 | < 0.1% |
| 0.00018 | 159 | < 0.1% |
| 0.0002 | 238 | < 0.1% |
| 0.00021 | 628 | |
| 0.00022 | 104 | < 0.1% |
| 0.00023 | 746 | |
| 0.00025 | 299 | |
| 0.00026 | 262 | < 0.1% |
| 0.00029 | 137 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.87849 | 1 | |
| 469.45761 | 1 | |
| 439.90647 | 1 | |
| 437.37767 | 1 | |
| 430.90803 | 1 | |
| 414.05146 | 1 | |
| 389.02653 | 1 | |
| 386.60688 | 1 | |
| 384.82574 | 1 | |
| 379.4427 | 1 |
cat1
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7448 |
| Missing (%) | 0.2% |
| Memory size | 169.8 MiB |
| PC | |
|---|---|
| HC | |
| FOODS | |
| REF | 6179 |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.576822 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HC |
|---|---|
| 2nd row | HC |
| 3rd row | HC |
| 4th row | HC |
| 5th row | HC |
Common Values
| Value | Count | Frequency (%) |
| PC | 1657313 | |
| HC | 746562 | |
| FOODS | 571148 | 19.1% |
| REF | 6179 | 0.2% |
| (Missing) | 7448 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pc | 1657313 | |
| hc | 746562 | |
| foods | 571148 | 19.2% |
| ref | 6179 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 2403875 | |
| P | 1657313 | |
| O | 1142296 | |
| H | 746562 | 9.7% |
| F | 577327 | 7.5% |
| D | 571148 | 7.4% |
| S | 571148 | 7.4% |
| R | 6179 | 0.1% |
| E | 6179 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7682027 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2403875 | |
| P | 1657313 | |
| O | 1142296 | |
| H | 746562 | 9.7% |
| F | 577327 | 7.5% |
| D | 571148 | 7.4% |
| S | 571148 | 7.4% |
| R | 6179 | 0.1% |
| E | 6179 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7682027 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2403875 | |
| P | 1657313 | |
| O | 1142296 | |
| H | 746562 | 9.7% |
| F | 577327 | 7.5% |
| D | 571148 | 7.4% |
| S | 571148 | 7.4% |
| R | 6179 | 0.1% |
| E | 6179 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7682027 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 2403875 | |
| P | 1657313 | |
| O | 1142296 | |
| H | 746562 | 9.7% |
| F | 577327 | 7.5% |
| D | 571148 | 7.4% |
| S | 571148 | 7.4% |
| R | 6179 | 0.1% |
| E | 6179 | 0.1% |
cat2
Categorical
High correlation 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7448 |
| Missing (%) | 0.2% |
| Memory size | 184.4 MiB |
| CABELLO | |
|---|---|
| DEOS | |
| SOPAS Y CALDOS | |
| ROPA LAVADO | |
| HOGAR | |
| Other values (10) |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 7.6951468 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VAJILLA |
|---|---|
| 2nd row | VAJILLA |
| 3rd row | VAJILLA |
| 4th row | VAJILLA |
| 5th row | VAJILLA |
Common Values
| Value | Count | Frequency (%) |
| CABELLO | 813398 | |
| DEOS | 510270 | |
| SOPAS Y CALDOS | 344693 | |
| ROPA LAVADO | 266667 | 8.9% |
| HOGAR | 223478 | 7.5% |
| PIEL2 | 209945 | 7.0% |
| ADEREZOS | 204671 | 6.8% |
| VAJILLA | 155239 | 5.2% |
| PIEL1 | 90819 | 3.0% |
| ROPA ACONDICIONADOR | 82492 | 2.8% |
| Other values (5) | 79530 | 2.7% |
Length
| Value | Count | Frequency (%) |
| cabello | 813398 | |
| deos | 510270 | |
| ropa | 359332 | |
| sopas | 344693 | |
| y | 344693 | |
| caldos | 344693 | |
| lavado | 266667 | 6.6% |
| hogar | 223478 | 5.5% |
| piel2 | 209945 | 5.2% |
| aderezos | 204671 | 5.1% |
| Other values (8) | 408080 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 3375272 | |
| A | 3360801 | |
| L | 2890792 | |
| E | 2081347 | |
| S | 1789490 | |
| D | 1524166 | |
| C | 1333248 | 5.8% |
| 1048718 | 4.6% | |
| P | 1013302 | 4.4% |
| R | 900270 | 3.9% |
| Other values (14) | 3623381 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 21591305 | |
| Space Separator | 1048718 | 4.6% |
| Decimal Number | 300764 | 1.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 3375272 | |
| A | 3360801 | |
| L | 2890792 | |
| E | 2081347 | |
| S | 1789490 | |
| D | 1524166 | |
| C | 1333248 | 6.2% |
| P | 1013302 | 4.7% |
| R | 900270 | 4.2% |
| B | 813398 | 3.8% |
| Other values (11) | 2509219 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 209945 | |
| 1 | 90819 |
Space Separator
| Value | Count | Frequency (%) |
| 1048718 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21591305 | |
| Common | 1349482 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 3375272 | |
| A | 3360801 | |
| L | 2890792 | |
| E | 2081347 | |
| S | 1789490 | |
| D | 1524166 | |
| C | 1333248 | 6.2% |
| P | 1013302 | 4.7% |
| R | 900270 | 4.2% |
| B | 813398 | 3.8% |
| Other values (11) | 2509219 |
Common
| Value | Count | Frequency (%) |
| 1048718 | ||
| 2 | 209945 | 15.6% |
| 1 | 90819 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22940787 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 3375272 | |
| A | 3360801 | |
| L | 2890792 | |
| E | 2081347 | |
| S | 1789490 | |
| D | 1524166 | |
| C | 1333248 | 5.8% |
| 1048718 | 4.6% | |
| P | 1013302 | 4.4% |
| R | 900270 | 3.9% |
| Other values (14) | 3623381 |
cat3
Text
| Distinct | 93 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7448 |
| Missing (%) | 0.2% |
| Memory size | 185.9 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 7.8008947 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cristalino |
|---|---|
| 2nd row | Cristalino |
| 3rd row | Cristalino |
| 4th row | Cristalino |
| 5th row | Cristalino |
| Value | Count | Frequency (%) |
| shampoo | 380777 | 10.8% |
| aero | 337515 | 9.6% |
| acondicionador | 308574 | 8.8% |
| polvo | 153986 | 4.4% |
| liquido | 126363 | 3.6% |
| sopas | 121289 | 3.4% |
| jabon | 110439 | 3.1% |
| mayonesa | 108842 | 3.1% |
| gel | 102749 | 2.9% |
| noaero | 81470 | 2.3% |
| Other values (88) | 1694129 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2097836 | 9.0% |
| O | 1879074 | 8.1% |
| A | 1782653 | 7.7% |
| a | 1527350 | 6.6% |
| e | 1131100 | 4.9% |
| C | 992719 | 4.3% |
| r | 987568 | 4.2% |
| S | 848721 | 3.6% |
| N | 791113 | 3.4% |
| l | 789103 | 3.4% |
| Other values (41) | 10428806 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11379031 | |
| Lowercase Letter | 11332081 | |
| Space Separator | 544931 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2097836 | |
| a | 1527350 | |
| e | 1131100 | |
| r | 987568 | |
| l | 789103 | 7.0% |
| s | 741838 | 6.5% |
| i | 662332 | 5.8% |
| n | 564373 | 5.0% |
| u | 431503 | 3.8% |
| d | 397628 | 3.5% |
| Other values (15) | 2001450 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1879074 | |
| A | 1782653 | |
| C | 992719 | |
| S | 848721 | |
| N | 791113 | |
| I | 750823 | 6.6% |
| D | 719180 | 6.3% |
| P | 684344 | 6.0% |
| M | 646255 | 5.7% |
| R | 569156 | 5.0% |
| Other values (15) | 1714993 |
Space Separator
| Value | Count | Frequency (%) |
| 544931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22711112 | |
| Common | 544931 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2097836 | 9.2% |
| O | 1879074 | 8.3% |
| A | 1782653 | 7.8% |
| a | 1527350 | 6.7% |
| e | 1131100 | 5.0% |
| C | 992719 | 4.4% |
| r | 987568 | 4.3% |
| S | 848721 | 3.7% |
| N | 791113 | 3.5% |
| l | 789103 | 3.5% |
| Other values (40) | 9883875 |
Common
| Value | Count | Frequency (%) |
| 544931 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23205927 | |
| None | 50116 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2097836 | 9.0% |
| O | 1879074 | 8.1% |
| A | 1782653 | 7.7% |
| a | 1527350 | 6.6% |
| e | 1131100 | 4.9% |
| C | 992719 | 4.3% |
| r | 987568 | 4.3% |
| S | 848721 | 3.7% |
| N | 791113 | 3.4% |
| l | 789103 | 3.4% |
| Other values (40) | 10378690 |
None
| Value | Count | Frequency (%) |
| ñ | 50116 |
brand
Categorical
High correlation 
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7448 |
| Missing (%) | 0.2% |
| Memory size | 180.5 MiB |
| NIVEA | |
|---|---|
| SHAMPOO3 | |
| MAGGI | |
| DEOS1 | |
| MUSCULO | |
| Other values (32) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 6.3284487 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Importado |
|---|---|
| 2nd row | Importado |
| 3rd row | Importado |
| 4th row | Importado |
| 5th row | Importado |
Common Values
| Value | Count | Frequency (%) |
| NIVEA | 384335 | |
| SHAMPOO3 | 338209 | |
| MAGGI | 322839 | |
| DEOS1 | 299785 | |
| MUSCULO | 242680 | 8.1% |
| LIMPIEX | 217199 | 7.3% |
| SHAMPOO2 | 141777 | 4.7% |
| NATURA | 120648 | 4.0% |
| SHAMPOO1 | 109610 | 3.7% |
| COLBERT | 89406 | 3.0% |
| Other values (27) | 714714 |
Length
| Value | Count | Frequency (%) |
| nivea | 384335 | |
| shampoo3 | 338209 | |
| maggi | 322839 | |
| deos1 | 299785 | |
| musculo | 242680 | 8.1% |
| limpiex | 217199 | 7.3% |
| shampoo2 | 141777 | 4.8% |
| natura | 120648 | 4.0% |
| shampoo1 | 109610 | 3.7% |
| colbert | 89406 | 3.0% |
| Other values (27) | 714714 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 2259859 | |
| A | 2147786 | 11.4% |
| M | 1578291 | 8.4% |
| I | 1449134 | 7.7% |
| E | 1399987 | 7.4% |
| S | 1373798 | 7.3% |
| P | 948177 | 5.0% |
| L | 781952 | 4.1% |
| N | 770778 | 4.1% |
| G | 727286 | 3.9% |
| Other values (25) | 5429336 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17595246 | |
| Decimal Number | 1155634 | 6.1% |
| Lowercase Letter | 115504 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2259859 | |
| A | 2147786 | |
| M | 1578291 | 9.0% |
| I | 1449134 | 8.2% |
| E | 1399987 | 8.0% |
| S | 1373798 | 7.8% |
| P | 948177 | 5.4% |
| L | 781952 | 4.4% |
| N | 770778 | 4.4% |
| G | 727286 | 4.1% |
| Other values (15) | 4158198 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 28876 | |
| m | 14438 | |
| p | 14438 | |
| r | 14438 | |
| t | 14438 | |
| a | 14438 | |
| d | 14438 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 556839 | |
| 3 | 401281 | |
| 2 | 197514 | 17.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17710750 | |
| Common | 1155634 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 2259859 | |
| A | 2147786 | |
| M | 1578291 | 8.9% |
| I | 1449134 | 8.2% |
| E | 1399987 | 7.9% |
| S | 1373798 | 7.8% |
| P | 948177 | 5.4% |
| L | 781952 | 4.4% |
| N | 770778 | 4.4% |
| G | 727286 | 4.1% |
| Other values (22) | 4273702 |
Common
| Value | Count | Frequency (%) |
| 1 | 556839 | |
| 3 | 401281 | |
| 2 | 197514 | 17.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18866384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 2259859 | |
| A | 2147786 | 11.4% |
| M | 1578291 | 8.4% |
| I | 1449134 | 7.7% |
| E | 1399987 | 7.4% |
| S | 1373798 | 7.3% |
| P | 948177 | 5.0% |
| L | 781952 | 4.1% |
| N | 770778 | 4.1% |
| G | 727286 | 3.9% |
| Other values (25) | 5429336 |
sku_size
Real number (ℝ)
High correlation 
| Distinct | 75 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7448 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 445.277 |
| Minimum | 1 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 90 |
| median | 240 |
| Q3 | 450 |
| 95-th percentile | 1000 |
| Maximum | 10000 |
| Range | 9999 |
| Interquartile range (IQR) | 360 |
Descriptive statistics
| Standard deviation | 741.1227 |
|---|---|
| Coefficient of variation (CV) | 1.6644082 |
| Kurtosis | 39.527448 |
| Mean | 445.277 |
| Median Absolute Deviation (MAD) | 160 |
| Skewness | 5.1284545 |
| Sum | 1.3274607 × 109 |
| Variance | 549262.86 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 306300 | 10.2% |
| 400 | 214118 | 7.2% |
| 350 | 200788 | 6.7% |
| 90 | 173479 | 5.8% |
| 50 | 155471 | 5.2% |
| 10 | 125059 | 4.2% |
| 750 | 122149 | 4.1% |
| 100 | 120666 | 4.0% |
| 300 | 103069 | 3.4% |
| 800 | 90204 | 3.0% |
| Other values (65) | 1369899 |
| Value | Count | Frequency (%) |
| 1 | 14556 | 0.5% |
| 2 | 21958 | 0.7% |
| 3 | 3010 | 0.1% |
| 4 | 20122 | 0.7% |
| 5 | 49197 | 1.6% |
| 6 | 14872 | 0.5% |
| 8 | 18309 | 0.6% |
| 10 | 125059 | |
| 12 | 30376 | 1.0% |
| 15 | 28855 | 1.0% |
| Value | Count | Frequency (%) |
| 10000 | 3052 | 0.1% |
| 7500 | 32 | < 0.1% |
| 5000 | 14532 | 0.5% |
| 4500 | 38 | < 0.1% |
| 4000 | 18498 | 0.6% |
| 3000 | 80012 | |
| 2000 | 7768 | 0.3% |
| 1800 | 508 | < 0.1% |
| 1500 | 12842 | 0.4% |
| 1400 | 3906 | 0.1% |
stock_final
Real number (ℝ)
Missing  Zeros 
| Distinct | 12596 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 1839319 |
| Missing (%) | 61.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.139183 |
| Minimum | -27.31136 |
|---|---|
| Maximum | 1562.0245 |
| Zeros | 34082 |
| Zeros (%) | 1.1% |
| Negative | 28122 |
| Negative (%) | 0.9% |
| Memory size | 22.8 MiB |
Quantile statistics
| Minimum | -27.31136 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.76106 |
| median | 7.17641 |
| Q3 | 23.05673 |
| 95-th percentile | 111.2202 |
| Maximum | 1562.0245 |
| Range | 1589.3358 |
| Interquartile range (IQR) | 21.29567 |
Descriptive statistics
| Standard deviation | 74.750983 |
|---|---|
| Coefficient of variation (CV) | 2.7543564 |
| Kurtosis | 114.17328 |
| Mean | 27.139183 |
| Median Absolute Deviation (MAD) | 6.70467 |
| Skewness | 8.9532795 |
| Sum | 31191904 |
| Variance | 5587.7094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 34082 | 1.1% |
| 0.049 | 727 | < 0.1% |
| 0.11394 | 521 | < 0.1% |
| 0.7204 | 470 | < 0.1% |
| 3.42342 | 468 | < 0.1% |
| -1.57248 | 450 | < 0.1% |
| 0.04423 | 447 | < 0.1% |
| 17.26234 | 445 | < 0.1% |
| -0.01747 | 440 | < 0.1% |
| 27.70186 | 432 | < 0.1% |
| Other values (12586) | 1110849 | |
| (Missing) | 1839319 |
| Value | Count | Frequency (%) |
| -27.31136 | 206 | |
| -13.66656 | 65 | < 0.1% |
| -13.33127 | 196 | |
| -8.19961 | 64 | < 0.1% |
| -8.15986 | 86 | < 0.1% |
| -7.7212 | 24 | < 0.1% |
| -5.86579 | 65 | < 0.1% |
| -5.28091 | 94 | < 0.1% |
| -5.18307 | 242 | |
| -5.0992 | 51 | < 0.1% |
| Value | Count | Frequency (%) |
| 1562.02448 | 221 | |
| 1284.38214 | 158 | |
| 1212.36734 | 158 | |
| 1146.09799 | 213 | |
| 1097.55623 | 149 | |
| 1057.38804 | 189 | |
| 1037.85386 | 186 | |
| 1031.01561 | 176 | |
| 978.16446 | 46 | < 0.1% |
| 916.3419 | 215 |
Interactions
Correlations
| brand | cat1 | cat2 | cust_request_qty | cust_request_tn | customer_id | plan_precios_cuidados | product_id | sku_size | stock_final | tn | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| brand | 1.000 | 1.000 | 0.834 | 0.012 | 0.018 | 0.039 | 0.228 | 0.339 | 0.265 | 0.137 | 0.018 |
| cat1 | 1.000 | 1.000 | 1.000 | 0.016 | 0.018 | 0.054 | 0.041 | 0.343 | 0.226 | 0.132 | 0.018 |
| cat2 | 0.834 | 1.000 | 1.000 | 0.012 | 0.016 | 0.037 | 0.120 | 0.264 | 0.375 | 0.118 | 0.015 |
| cust_request_qty | 0.012 | 0.016 | 0.012 | 1.000 | 0.376 | -0.452 | 0.003 | -0.008 | 0.009 | -0.010 | 0.376 |
| cust_request_tn | 0.018 | 0.018 | 0.016 | 0.376 | 1.000 | -0.512 | 0.000 | -0.592 | 0.472 | 0.324 | 1.000 |
| customer_id | 0.039 | 0.054 | 0.037 | -0.452 | -0.512 | 1.000 | 0.006 | -0.007 | -0.031 | -0.007 | -0.512 |
| plan_precios_cuidados | 0.228 | 0.041 | 0.120 | 0.003 | 0.000 | 0.006 | 1.000 | 0.066 | 0.019 | 0.012 | 0.000 |
| product_id | 0.339 | 0.343 | 0.264 | -0.008 | -0.592 | -0.007 | 0.066 | 1.000 | -0.552 | -0.443 | -0.592 |
| sku_size | 0.265 | 0.226 | 0.375 | 0.009 | 0.472 | -0.031 | 0.019 | -0.552 | 1.000 | 0.354 | 0.472 |
| stock_final | 0.137 | 0.132 | 0.118 | -0.010 | 0.324 | -0.007 | 0.012 | -0.443 | 0.354 | 1.000 | 0.324 |
| tn | 0.018 | 0.018 | 0.015 | 0.376 | 1.000 | -0.512 | 0.000 | -0.592 | 0.472 | 0.324 | 1.000 |
Missing values
Sample
| periodo | customer_id | product_id | plan_precios_cuidados | cust_request_qty | cust_request_tn | tn | cat1 | cat2 | cat3 | brand | sku_size | stock_final | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2017-01-01 | 10234 | 20524 | 0 | 2 | 0.05300 | 0.05300 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 1 | 2017-01-01 | 10032 | 20524 | 0 | 1 | 0.13628 | 0.13628 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 2 | 2017-01-01 | 10217 | 20524 | 0 | 1 | 0.03028 | 0.03028 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 3 | 2017-01-01 | 10125 | 20524 | 0 | 1 | 0.02271 | 0.02271 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 4 | 2017-01-01 | 10012 | 20524 | 0 | 11 | 1.54452 | 1.54452 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 5 | 2017-01-01 | 10080 | 20524 | 0 | 1 | 0.01514 | 0.01514 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 6 | 2017-01-01 | 10015 | 20524 | 0 | 4 | 0.10600 | 0.10600 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 7 | 2017-01-01 | 10062 | 20524 | 0 | 1 | 0.18928 | 0.18928 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 8 | 2017-01-01 | 10159 | 20524 | 0 | 3 | 0.02271 | 0.02271 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| 9 | 2017-01-01 | 10183 | 20524 | 0 | 1 | 0.01514 | 0.01514 | HC | VAJILLA | Cristalino | Importado | 500.0 | NaN |
| periodo | customer_id | product_id | plan_precios_cuidados | cust_request_qty | cust_request_tn | tn | cat1 | cat2 | cat3 | brand | sku_size | stock_final | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2988640 | 2019-12-01 | 10021 | 20853 | 0 | 8 | 0.15829 | 0.15829 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988641 | 2019-12-01 | 10093 | 20853 | 0 | 1 | 0.05574 | 0.05574 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988642 | 2019-12-01 | 10003 | 20853 | 0 | 9 | 0.62426 | 0.62426 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988643 | 2019-12-01 | 10367 | 20853 | 0 | 1 | 0.00446 | 0.00446 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988644 | 2019-12-01 | 10278 | 20853 | 0 | 5 | 0.06020 | 0.06020 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988645 | 2019-12-01 | 10105 | 20853 | 0 | 1 | 0.02230 | 0.02230 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988646 | 2019-12-01 | 10092 | 20853 | 0 | 1 | 0.00669 | 0.00669 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988647 | 2019-12-01 | 10006 | 20853 | 0 | 7 | 0.02898 | 0.02898 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988648 | 2019-12-01 | 10018 | 20853 | 0 | 4 | 0.01561 | 0.01561 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2988649 | 2019-12-01 | 10020 | 20853 | 0 | 2 | 0.01561 | 0.01561 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
Duplicate rows
Most frequently occurring
| periodo | customer_id | product_id | plan_precios_cuidados | cust_request_qty | cust_request_tn | tn | cat1 | cat2 | cat3 | brand | sku_size | stock_final | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2017-01-01 | 10001 | 20010 | 0 | 3 | 1.31914 | 1.31914 | HC | ROPA LAVADO | Polvo | LIMPIEX | 400.0 | NaN | 2 |
| 1 | 2017-01-01 | 10001 | 20021 | 0 | 3 | 1.87824 | 1.87824 | HC | ROPA LAVADO | Polvo | LIMPIEX | 400.0 | NaN | 2 |
| 2 | 2017-01-01 | 10001 | 20022 | 0 | 10 | 15.35789 | 15.35789 | HC | ROPA LAVADO | Polvo | LIMPIEX | 800.0 | NaN | 2 |
| 3 | 2017-01-01 | 10001 | 20037 | 0 | 6 | 5.40278 | 5.40278 | FOODS | SOPAS Y CALDOS | Caldo Cubo | MAGGI | 12.0 | NaN | 2 |
| 4 | 2017-01-01 | 10001 | 20105 | 0 | 8 | 6.95036 | 6.95036 | FOODS | SOPAS Y CALDOS | Salsas Wet | MAGGI | 350.0 | NaN | 2 |
| 5 | 2017-01-01 | 10002 | 20010 | 0 | 16 | 57.77117 | 56.09386 | HC | ROPA LAVADO | Polvo | LIMPIEX | 400.0 | NaN | 2 |
| 6 | 2017-01-01 | 10002 | 20020 | 0 | 12 | 29.24813 | 29.24813 | HC | ROPA LAVADO | Polvo | LIMPIEX | 800.0 | NaN | 2 |
| 7 | 2017-01-01 | 10002 | 20021 | 0 | 22 | 37.49491 | 36.21072 | HC | ROPA LAVADO | Polvo | LIMPIEX | 400.0 | NaN | 2 |
| 8 | 2017-01-01 | 10002 | 20022 | 0 | 11 | 42.00269 | 42.00269 | HC | ROPA LAVADO | Polvo | LIMPIEX | 800.0 | NaN | 2 |
| 9 | 2017-01-01 | 10002 | 20037 | 0 | 26 | 15.80998 | 15.11284 | FOODS | SOPAS Y CALDOS | Caldo Cubo | MAGGI | 12.0 | NaN | 2 |